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A SYSTEM FOR MONITORING NON-COINCIDENT, 
NONSTATIONARY PROCESS SIGNALS 

[0001] This invention was made with government support under Contract No. W-3 1- 
109-ENG-38 awarded to the Department of Energy. The Government has certain rights 
in this invention. 

BACKGROUND OF THE INVENTION 

[0002] This invention relates generally to a system for monitoring non-coincident, 
nonstationary process signals. More particularly, this invention relates to a system for 
monitoring non-coincident, nonstationary process signals used in detecting deficiencies in 
various stages of manufacturing processes, biological process and the like. 
[0003] There is often a need or desire to monitor finite length, non-stationary signals 
that may include repetitive deterministic artifacts that are non-coincident in time. This 
phenomenon occurs, for example, in many engineering systems that contain moving parts 
that are monitored by digitizing sensors monitoring signals relevant to the quality of 
those parts. 

[0004] For example, an assembly line where the thickness of manufactured plastic or 
metal components might be measured. In such an example, every component passing 
through the sensor produces a signal that has a shape that is substantially similar to the 
preceding signal — but the signal may be longer or shorter depending upon the speed of 
the conveyor belt. Another example would be the force applied to the die set in a metal 
stamping machine. Once again, a signal representing this force would possess a similar 
shape with every repetition of the machine's movement. The length of the force signal, 
however, may be longer or shorter depending upon how fast the machine is operating. 
Biological signals may also produce signals with repetitive deterministic artifacts. One 
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such example includes the use of cardiac signals from a biological heart monitored from 
EKG traces. 

[0005] In each of the foregoing cases, if one were to digitize and then plot the 
monitored signals, the length of the repetitive deterministic artifacts would vary from part 
to part or from cycle to cycle, depending upon the speed and variability of the system or 
organism being monitored. A reference signal can often be used to compare to these 
repetitive signal waveforms for detection of anomalies, but only if their lengths are 
exactly the same. If their lengths are not the same, large discrepancies between the 
reference signal and the input signal would be seen due to the signals not being 
coincident. Such discrepancies could result in an erroneous diagnosis. 

SUMMARY OF THE INVENTION 

[0006] It is therefore an object of the invention to develop an improved method for 
monitoring non-coincident and non-stationary process signals. 
[0007] It is a further object of the invention to develop an improved system for 
monitoring non-stationary, non-coincident process signals of a definite length. 
[0008] It is yet another object of the invention to develop an improved system for 
monitoring non-coincident, non-stationary process signals that correspond to a 
manufacturing process. 

[0009] It is yet another object of the invention to develop an system for monitoring 
non-coincident, non-stationary process signals that correspond to a biological process^ 
such as signals emanating from a biological heart . 

[0010] In accordance with the above objects, a system is provided including a series of 
steps for developing a reference and for characterizing an input signal or signals for 
meaningful comparison with the reference. The first step includes the use of a training 
sequence for determining a mean and variance of a reference wave form and to define a 
reference wave form length. The leading and falling edges of the repetitive deterministic 
artifacts are determined in the monitored signal and to calculate the sample length. The 
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monitored signal is then resampled to properly correlated with the reference signal, and 
the two signals are arranged such that they are coincident in time. The monitored signal 
is then shifted with respect to the sequence signal so that the monitored signal has the 
same number of samples as the reference length identified in the first stsep. The adjusted 
monitored signal is then compared to the stored reference signal. 
[0011] These and other objects, advantages and features of the invention together with 
the organization and manner of operation thereof will become apparent from the 
following detailed description when taken into conjunction with the accompanying 
drawings wherein like elements have like numerals throughout the drawings described 
below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] FIGURE 1 is a depiction of the overall operation of an example system of the 
present invention; 

[0013] FIGURE 2 is an illustration of detailed training module steps for the system of 
FIG. 1 according to one embodiment; 

[0014] FIGURE 3 illustrates the detailed monitoring module steps for the system of 
FIG. 1; 

[0015] FIGURE 4A shows an example of a raw data set from a repetitive process; and 
FIG. 4B shows the raw data sequence smoothed using the Savitzky-Golay filter; 
[0016] FIGURES 5A-5D show each of the identified signatures as they have been 
identified and extracted from the original data stream; 

[0017] FIGURE 6 shows the basic methodology for re-sampling using a digital 
fractional re-sampling filter; 

[0018] FIGURE 7 is a depiction of the logic diagram for an expert pump-surveillance 
system operated in accordance with an embodiment of the invention; 
[0019] FIGURE 8 is a representation of an expert system for online surveillance of a set 
of nuclear reactor coolant pumps; 
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[0020] FIGURE 9 is an illustration of a flow diagream of a bounded angle ratio test 
method of data analysis; 

[0021] FIGURE 10 is an illustration of conditions and values for carrying out a 
bounded angle ratio test; 

[0022] FIGURE 11 is an illustration of conditions for comparing similarity of two 
points X0 and XI on the illustration of FIGURE 10; 

[0023] FIGURE 12 shows EBR-II channel 1, primary pump 1, power under normal 
operational conditions, and modelled BART; 

[0024] FIGURE 13 shows EBR-II channel 2, primary pump 2, power under normal 
operational conditions, and modelled BART; 

[0025] FIGURE 14 shows EBR-II channel 3, primary pump 1, speed under normal 
operating conditions and modelled BART; 

[0026] FIGURE 15 shows EBR-II channel 4, primary pump 2, speed under normal 
operating conditions and modelled BART; 

[0027] FIGURE 16 shows channel 5 reactor outlet flow rate under normal operating 
conditions and modelled BART; 

[0028] FIGURE 17 shows EBR-II channel 6, primary pump 2, flow rate under normal 
conditions and modelled BART; 

[0029] FIGURE 18 shows EBR-II channel 7 subassembly outlet temperature 1 Al under 
normal operating conditions and modelled BART; 

[0030] FIGURE 19 shows channel 8 subassembly outlet temperature 2B1 1 under 
normal operating conditions and modelled BART; 

[0031] FIGURE 20 illustrates channel 9 subassembly outlet temperature 4E1 under 
normal operating conditions; 

[0032] FIGURE 21 illustrates channel 10 subassembly outlet temperature 4F1 under 
normal operating conditions and modelled BART; and 

[0033] FIGURE 22A shows an EBR-II primary pump power signal with an imposed 
positive drift; FIGURE 22B shows an application of SPRT to the signal of FIGURE 22 A; 
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FIGURE 22C shows an EBR-II primary pump power signal with an imposed positive 
step function; FIGURE 22D shows an application of SPRT to the signals of FIGURE 
22C; FIGURE 22E shows an EBR-II primary pump power signal with an imposed 
sinusoidal disturbance; FIGURE 22F shows an application of SPRT to the signal of 
FIGURE 22E. 

DETAILED DESCRIPTION OF THE INVENTION 

[0034] In order to illustrate embodiments of the invention, wherein the monitored 
signal and the reference signal comprise a repetitive waveforms, an explanation is 
provided to generally describe the methodology and function for the systematic 
procedure of the invention and then the stepwise algorithmic approach is presented in 
detail. Although the manner in which the phenomena are described is one rigorous 
approach which explains the operation of the invention for those skilled in the art, other 
conventional mathematical and theoretical explanations can also be used to describe 
similar results which characterize embodiments of the invention. The invention is 
therefore not limited to the description of its operation by the following illustrative 
mathematical explanations. 

[0035] The present invention involves the use of a step-wise procedure for monitoring a 
plurality of repetitive signals. FIG. 1 depicts the overall operation of one embodiment of 
a system of the present invention. The system runs on a computer or is embedded into 
monitoring hardware 201. Before data are analyzed using the system, a training data 
source must be selected, shown at 202. The selection can be an on-line or real-time 
source 204, or it can be a storage media source 203. Once the source has been selected, 
data are collected for building the trained reference patterns, shown at 205, and the results 
are stored at 206. The training data are fed into the training module, shown at 208 and all 
pertinent parameters and reference patterns are calculated. 
[0036] After the training process completes steps 201-206, the data source for 
monitoring is selected, shown at 207. Again, the selection can be an on-line or real-time 
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source, shown at 204, or it can be a storage media source, shown at 203. Next, data are 
acquired for analysis, shown at 208, from the chosen source. The acquired data are fed as 
input to the system monitoring module, shown at 209, which determines whether or not 
the input data are deviating from the trained normal conditions. The results from the 
monitoring module are directed at 210 to one or both of a data logging system, shown at 
211, and/or a terminal display or monitoring indication mechanism, represented at 212. 
[0037] FIG. 2 illustrates the detailed training module steps for one form of a system of 
the present invention. First, a training data source is selected at 213. The selection can 
be an on-line or real-time source, shown at 215, or it can be a storage media source, 
shown at 214. The data are read into memory either via data acquisition (DAQ) hardware 
or from storage media at 216 depending on the choice made in step 213. 
[0038] The first data processing step, shown at 217, is a method for determining the 
leading and trailing edges of each individual signature in the input. An example of this 
procedure is illustrated in FIGS. 4A-4B and 5A-5D. FIG. 4A shows an example of a raw 
data set from a repetitive process. The threshold is used to mark the leading edge and 
trailing edges of each of the four signatures in the data sequence. Because the data is 
noisy, unique identifiers for the leading and trailing edges are impossible to find using 
this threshold. One method of overcoming this problem, however, is illustrated in FIG. 
4B. The raw data sequence is smoothed using a well-known smoothing algorithm called 
the Savitzky-Golay filter. Much of the noise is suppressed using the Savitzky-Golay 
filter so that the threshold can be used effectively to identify the leading and trailing 
edges of each signature. The markers in FIG. 4B show where each of the edges was 
identified. FIGS. 5A-5D show each of the identified signatures as they have been 
identified and extracted from the original data stream. 

[0039] The next step in the training procedure is to store a plurality of identified 
signatures in computer or embedded memory 218. As each signature is extracted from 
the training data set its sample length is measure and stored as well, shown at 219. Then 
a reference length N re f is calculated from all of the measured signature lengths at 220. 
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The reference length can be determined from the minimum, maximum, median or mean 
of the plurality of measured signature lengths. 

[0040] The reference length N re f is used to determine the re-sampling rate applied to 
each stored signature so that the lengths of all signatures are the same, represented at 221. 
The re-sampling is accomplished using a digital fractional re-sampling filter. The basic 
structure of the filter is shown in FIG. 6. If the raw input signature or data sequence 
representated by x(n) at 243 has an original length of N, then the signature is re-sampled 
using the re-sampling filter to produce a new signature of length N re f. First x(n) is fed 
through an expander at 244 that inserts N re f zeros between each original sample. Then a 
low-pass anti-aliasing filter 245, is applied to the resulting zero padded data sequence 
acting as an interpolator. The interpolated sequence is then decimated at 246 by a factor 
of N to produce the desired length of N re f for the output signature y(n). In cases where N 
and N re f are large, may be more efficient to first simplify the ratio N re f /N to their 
equivalent ratio of smallest integer (i.e., 40/30 = 4/3). 

[0041] In step 222 the re-sampled signatures are padded on both sides with a plurality 
of zeros. Each new re-sampled signature is compared with all previously processed 
signatures using a vector similarity calculation defined to be between 0 and 1 (1 for 
identical and 0 for no similarity) at step 223. The new signature is shifted forward 
and/or backward until the similarity is maximized, ensuring that the signatures optimally 
line up with one another. After the signatures have been lined up, the extraneous samples 
on both ends of the signature are removed at 224. 

[0042] The next step in the training process is to calculate the mean and standard 
deviation for each sample in the N re f length signatures producing N re f mean values and 
N re f standard deviation values, shown at 225. The parameter N re f and the vectors of mean 
values and standard deviation values are stored for use during the monitoring phase of 
operation at 226 and the training is completed at step 227. 

[0043] FIG. 3 illustrates the detailed monitoring module steps for the system of the 
present invention. First, a monitored data source is selected at 228. The selection can be 
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an on-line or real-time source, shown at 230, or it can be a storage media source, 
represented at 229. Next the training parameters; mean values standard deviation 
values (a), and reference length N re f, are loaded into memory at 23 1 and the monitored 
data are read into memory either via data acquisition (DAQ) hardware or from storage 
media depending on the choice made in step 228. 

[0044] The leading and trailing edges of each signature present in the monitored data 
are identified sequentially at 233 using the procedure described during the training phase 
step 217. The signatures are re-sampled to equalize their lengths in step 234 using the 
same procedure as in step 221. The similarity optimization at 235, 236 and 237 is used to 
line the monitored signature with the reference mean, calculated during step 225 of the 
training phase. A number of similarity measurement techniques may be used. In one 
embodiment of the invention, a bounded angle ratio test (BART) is used as the similarity 
measurement technique. The BART system is discussed in detail in U.S. Patent 
Application No. 09/373,326, incorporated herein by reference. It is also possible to use 
other systematic methods for the third step 520. For example, one could measure the 
distance between two Euclidean vectors as a possible technique. The details of the most 
preferred BART measurement technique are described below. 

[0045] The re-sampled and lined up signature is then differenced with the mean value 
vector to produce the residual vector R in step 238. In a particular embodiment of the 
invention, this is accomplished using a non-stationary sequential probability ratio test 
(SPRT). The SPRT system is discussed in detail in U.S. Patent No. 5,223,207, and 
incorporated herein by reference. A SPRT decision ratio is then calculated to determine 
whether the monitored signal falls outside of normal operating conditions. This 
monitoring procedure can continue in real-time for the remainder of the operating run. 
Alternatively, the procedure can continue until a user decides to retrain the automated 
system. 

[0046] Parameter settings for the detection engine 240 are set manually before 
monitoring begins or are loaded from a stored data file that can be used over and over at 
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step 239. The results of the detection engine 240 are then processed in step 241 to 
determine the amount of deviation in the monitored signatures from the trained reference 
signature. The processing step produces an alert if the deviation is greater than a user 
specified amount (SFMp-positive deviation, SFMn-negative deviation, SFMs-standard 
deviation change) with a confidence level determined by specified false (a) and missed 
alarm (p) probabilities. The alert is then logged and/or displayed in the final step of the 
monitoring process at 242. 

[0047] As described above, a non-stationary sequential probability ratio test (SPRT) is 
preferably used to compare the adjusted monitored signal to the stored reference signals. 
In one example of the method, SPRT teaches a expert system and method to determine 
the degradation of nuclear reactant coolant pumps and their respective sensors prior to 
failure. 

[0048] FIG. 8. illustrates the architecture of the expert system for an online pump- 
surveillance system. The two coolant pumps 1 and 2 are each equipped with numerous 
sensors 3-6. A typical sensor arrangement is depicted in FIG. 8 where seven sensors are 
employed: three sensors 3 which monitor the rotor shaft speed, two accelerometers 4 
which monitor the mechanical vibration of the pump, a pump power measuring device 5 
which measures the power needed by the motor to turn the rotor, and a discharge pressure 
transducer 6 which measures the flow rate of coolant through the pump. The information 
from the sensors 3-6 is transmitted to the data acquisition system 7 (DAS) which then 
interfaces with the artificial intelligence (AI) based inference engine 8. The AI inference 
engine 8 implements an operability logic algorithm illustrated in FIG. 7. The AI software 
for the inference engine 8 is supported by a layer of utility routines which perform 
generic functions such as loading external tables, providing access to shared knowledge 
base, activating interprocess synchronization, and performing network communication. 
Output from the AI engine 8 is integrated to a color-graphics display 9 in the reactor 
room and is multiplexed back to the data acquisition system 7 for archive backup storage. 
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If the inference engine 8 detects a degradation in the pump or its sensors, a pump sensor 
failure alarm 10 or a pump disturbance alarm 1 1 is sounded. 
[0049] FIG. 7 illustrates a flow chart for determining the condition of the cooling 
pumps through the employment of a sequence of mathematical algorithms associated 
with a series of sequential probability ratio test (SPRT) modules. The input signals 12 
are acted on mathematically by a sensitive pattern recognition technique, the sequential 
probability ratio test (SPRT). The use of the SPRT technique through several "if-then" 
steps provides for early annunciation of sensor operability or degradation of the coolant 
pump. Each of the modules 13, 14, 15, and 16 employs the SPRT technique to determine 
the condition of the respective sensors for the purpose of determining whether a problem 
is sensor or pump related. The modules present in the expert system include a shaft 
speed SPRT module 13, a vibration level SPRT module 14, a power signal SPRT module 
15, and a discharge pressure SPRT module 16. Each SPRT module is connected to an 
audible alarm 1 7 which is sounded when a sensor degradation is determined. If no sensor 
degradation is determined the degradation is determined to be due to the pump, and the 
pump disturbance alarm 1 1 sounds. 

[0050] The various recited SPRT modules monitor and compare the signals from two 
similar sensors which respond to a single parameter representing a physical condition 
associated with the pump. The purpose of this comparison is to identify subtle changes in 
the statistical quality of the noise associated with either signal when compared one to the 
other. In applications involving two or more reactor coolant pumps equipped with 
identical sensors, a SPRT monitor applied to the pumps will provide a sensitive 
annunciation of any physical disturbance affecting one of the pumps. If each of the 
pumps had only one sensor, it would be difficult for the SPRT technique to distinguish 
between a pump degradation event and a degradation of the sensor itself. However, when 
each pump is equipped with multiple, redundant sensors, the SPRT technique can be 
applied to pairs of sensors on each individual pump for sensor-operability verification. 
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[0051] As is illustrated in the logic diagram of FIG. 7, the expert system is synthesized 
as a collection of "if-then" type rules. Each SPRT module processes and compares the 
stochastic components of the signals from two sensors that are ostensibly following the 
same physical process. If any physical disturbance causes the noise characteristics for 
either signal to change, that is, a larger variance, skewness, or signal bias, then the SPRT 
technique provides a sensitive and rapid annunciation of that disturbance while 
minimizing the probabilities of both false alarms and missed alarms. 
[0052] The processor 18, of module 13, first interrogates the signals Nl and N2, 
representing the mean shaft speed for the coolant pumps 1 and 2, respectively. The mean 
shaft speed signal is obtained by averaging the outputs of the three RPM sensors 3 on 
each of the pumps 1 and 2. If a problem is identified in the comparison of Nl and N2, a 
sequence of SPRT tests is invoked to validate the three sensors on the pump 1, signified 
by Al, Bl, and CI . If one of those sensors is identified as degraded, an audible alarm 1 1 
is actuated. If the three sensors on pump 1 are found to be operating within tolerance, 
then the three corresponding sensors on the pump 2 are tested. If all six sensors are 
confirmed to be operational, execution is passed to the next SPRT module which in this 
case is the SPRT module 14 which tests the vibration-level variable. If these sensors are 
found to be operational, then the testing is functionally shifted to the module 15 the 
power-signal variable, and then if it is found to be functioning properly to the module 16 
the discharge-pressure variable. This sequential organization is illustrated in FIG. 7. If a 
problem is identified in any module, an audible alarm, 10, 1 1 or 17 is sounded in the 
reactor control room, and the operator can initiate a manual shutdown of the reactor to 
repair the identified problem. 

[0053] The objective of the AI engine in the expert system is to analyze successive 
observations of a discrete process Y which represents a comparison of the stochastic 
components of two physical processes monitored by similar sensors. Let yk represent a 
sample from the process Y at time t. During normal operations with an undergraded 
physical system and with sensors that are functioning within specifications, the ykj should 
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be normally distributed with means 0. If the two signals being compared do not have the 
same nominal means due, for example, to differences in calibration, then the input signals 
will be pre-normalized to the same nominal mean values during initial operation. 
[0054] The specific goal of the Al engine is to declare system 1 or system 2 degraded if 
the drift in Y is sufficiently large that the sequence of observations appears to be 
distributed about means + M or - M, where M is a preassigned system distribution 
magnitude. The SPRT provides a quantitative framework that enables us to decide 
between two hypotheses, H and H2, namely: 

[0055] HI : Y is drawn from a Gaussian product distribution function (PDF) with 
means M and variance a 2 . 

[0056] H2: Y is drawn from a Gaussian PDF with mean 0 and variance a 2 . 
[0057] If it is supposed that HI or H2 is true, we wish to decide for HI or H2 with 
probability (1-0) or (1-ot) respectively, where a and p represent the error 
(misidentification) probabilities. 

[0058] From the theory described by Wald and Wolfowitz in "Optimum Character of 
the Sequential Probability Ratio Test," Ann. Math. Stat., 19,326 (1948), the most 
powerful test depends on the likelihood ratio l n , where 

Probability of observed sequence given HI true. 
Probability of observed sequence given H2 true. 

[0059] After n observations have been made, the sequential probability ratio is just the 
product of the probability ratio is just the product of the probability ratios for each step: 



l n =(PRl)-(PR2)-(PR3)-. . .(PR H ) 
or 
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. f{y\H\) 

L - n j , 

^f{y\H2) 

where Fiy^H) is the distribution of the random variable;;. 

[0060] the Wald-Wolfowitz theory operates as follows: 
[0061] Continue sampling as long as 

A<\ n <B (1) 
[0062] Stop sampling and decide HI as soon as 1^ SB, and stop sampling and decide 
H2 as soon as \ n The acceptance thresholds are related to the error (misidentification) 
probabilities by the following expressions: 

(2) 



l-a a 
where 

a = probability of accepting HI when H2 is true (false alarm probability) 

P = probability of accepting H2 when HI is true (missed alarm probability) 

[0063] Assuming the random variable y k is normally distributed, the likelihood that HI 

is true (mean M, variance o 2 ) is given by 



L{y\>y2>y*>—y n \ m )= 



1 



(2^) n/2 c7 : 



-exp 



1 



2a 2 



k=\ 



k=\ 



(3) 



Similarly for HI (mean o, variance a ), 



L{y l ,y 2 ,y 3 ,---y n \H2)= 



l 



\l7l) G 



exp 



2a 



1 " 

— Z-y* 2 



(4) 



[0064] The ratio of equations (3) and (4) gives the likelihood ratio l n ; where l n is 
expressed as 
-1 



/„ = exp 



2cr 2 k=l 



^M(m-2y k ) 



(5) 



combining equations 1, 2 and 5, and taking the natural logs, gives 
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ln[/? /(I -«)]<-^XM(M-2^)< ln[(l - f3)a] 
where 

5 P7?r = _±J M (M-2^)or 

then the sequential sampling and decision strategy can be concisely represented as 

If SPRT<ln(p/l-cc) accept H2 

Ifln[p/(l-a)]<SPRT<ln[(l-p)/a] 5 

continue sampling 

If SPRT>ln[(l-p)/a] accept HI. 

[0065] The SPRT analysis formulated here cannot be applied directly to non-Gaussian 
signals. For applications to nuclear system signals contaminated by non-Gaussian noise, 
an attempt should first be made to pretreat the input signals with a normalizing 
transformation. 

[0066] For applications where (a) one requires a high degree of assurance that a system 
is functioning within specifications and (b) there is not a large penalty associated with 
false alarms, it is not uncommon to specify a B (missed alarm probability) that is much 
smaller than A (false alarm probability). In safety critical systems one may be more 
willing to incur a false alarm than a missed alarm. For applications where a large cost 
penalty is incurred with any false alarms, it is desirable to keep both A and B small. 
[0067] The trade-off that must be considered before one specifies arbitrarily small 
values for A and B is the effect this may have on the sensitivity and maximum decision 
time needed by the SPRT to annunciate a disturbance. The desired sensitivity of the 
SPRT is fixed by specification of M, the system disturbance magnitude. For a given 
value of M, the average sample number required to reach a decision is influenced by A 
and B and also by the variance associated with the signals being monitored. It takes 
longer to identify a subtle change in a process characterized by a low signal-to-noise ratio 
than in one with a high signal-to-noise ratio. 



Atty. Dkt. No.: 051583-0238 
(ANL-IN-99-021) 



[0068] The non-stationary version of the SPRT algorithm is a slightly modified version 
of Wald's SPRT. In the non-stationary case, the failure magnitude, M, reference signal 

SPRT(n) = SPRT(n - 1) + ^M{y{n) - //(*)) - 

a {n)\ 2 J 

(or mean), (i, and the reference variance, &2, are sample dependent. Therefore, the non- 
stationary SPRT equation becomes 



where n = 1,2,. . ,,L and L is the length of the length equalized signals. In this case, y(n) 
is the length of the equalized monitored signal, \i(n) is the corresponding reference 
signal generated during the training phase and & 2 (n) is the variance of each point in \i(n). 

[0069] The bounded angle ratio test (hereinafter BART) mentioned above is employed 
in systems with more than two variables, as shown in FIG. 9. For example, BART can be 
used on an actual sensor signal exhibiting non-white characteristics, such as for example, 
on sensor signals from the primary pump #2 of the EBR-II nuclear reactor at Argonne 
National Laboratory (West) in Idaho. In such a case, the signal can be a measure of the 
pump's speed over a given amount of time. In such a situation, one can use a nonlinear 
multivariate regression technique that employs an N Dimensional Space (known in vector 
calculus terminology as hyperspace) to model the relationships between all of the 
variables. This regression procedure results in a nonlinear synthesized estimate for each 
input observation vector based on the hyperspace regression model. The nonlinear 
multivariate regression technique is centered around the hyperspace BART operator that 
determines the element by element and vector to vector relationships of the variables and 
observation vectors, given a set of system data that is recorded during a time period when 
everything is functioning correctly. 

[0070] In the BART method described in FIG. 9, the method is also split into a training 
phase and a monitoring phase. The first step in the training phase is to acquire a data 
matrix continuing data samples from all of the sensors (or data sources) used for 
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monitoring the system that are coincident in time and are representative of normal system 
operation. Then the BART parameters are calculated for each sensor (X med , X max and 

X . ). Here X . is the median value of a sensor. The next step is to determine the 

min 7 med r 

similarity domain height for each sensor (h) using the BART parameters X med , X max and 
X . . Once these parameters are calculated a subset of the data matrix is selected to 

min r 

create a model matrix (H) that is used in the BART estimation calculations. Here, H is an 
NxM matrix where N is the number of sensors being monitored and M is the number of 
observations stored from each sensor. The last steps taken during the training phase are 
the SPRT parameters calculations. The calculations are analogous to the calculations in 
the other methods, except that now the standard deviation value used to calculate SDI is 
obtained from BART estimation errors from each sensor (or data source) under normal 
operating conditions. 

[0071] During the BART monitoring phase, a sample vector is acquired at each time 
step t, that contains a reading from all of the sensors (or data sources) being used. Then 
the similarity angle (S A) between the sample vector and each sample vector stored in H 
is calculated. Next an estimate of the input sample vector Y is calculated using the 
BART estimation equations. The difference between the estimate and the actual sensor 
values is then used as input to the SPRT module. Each difference is treated separately so 
that a decision can be made on each sensor independently. This method is described in 
more detail hereinafter. 

[0072] In this preferred embodiment of FIG. 9 of the invention, the method measures 
similarity between scalar values. BART uses the angle formed by the two points under 
comparison and a third reference point lying some distance perpendicular to the line 
formed by the two points under comparison. By using this geometric and trigonometric 
approach, BART is able to calculate the similarity of scalars with opposite signs. 
[0073] In the most preferred form of BART an angle domain must be determined. The 
angle domain is a triangle whose tip is the reference point (R), and whose base is the 
similarity domain. The similarity domain consists of all scalars which can be compared 
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with a valid measure of similarity returned. To introduce the similarity domain, two 
logical functional requirements can be established: 

0 The similarity between the maximum and minimum values in the similarity 
domain is 0, 

and 

1 the similarity between equal values is 1 . 

Thus the similarity range (i.e. all possible values for a measure of similarity), is in the 
range 0 to 16 inclusive. 

[0074] BART also requires some prior knowledge of the numbers to be compared for 
determination of the reference point (R). Unlike a ratio comparison of similarity, BART 
does not allow "factoring out" in the values to be compared. For example, with the 
BART methodology the similarity between 1 and 2 is not necessarily equal to the 
similarity between 2 and 4. Thus, the location of R is vital for good relative similarities 
to be obtained. R lies over the similarity domain at some distance h, perpendicular to the 
domain. The location on the similarity domain at which R occurs (X med ) is related to the 

statistical distribution of the values to be compared. For most distributions, the median or 
mean is sufficient to generate good results. In a preferred embodiment the median is used 
since the median provides a good measure of data density and is resistant to skewing 
caused by large ranges of data. 

[0075] Once X med has been determined, it is possible to calculate h. In calculating h, it 

is necessary to know the maximum and minimum values in the similarity domain. (X max 

and X mjn respectively) for normalization purposes the angle between X mjn and X max is 

defined to be 90°. The conditions and values defined so far are illustrated in FIG. 10. 
From this triangle it is possible to obtain a system of equations and solve for h as shown 
below: 

c=X -X . 

med min 

d=x -x , 

max med 
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2 2 2 

a =c +h (19) 

i 2 -2 2 

b =d +h 

2 2 2 

(c+d) =a +b 

2 2 2 2 

(c+d) =c +d +2h 

h 2 =cd 

h = Vcd 

[0076] Once h has been calculated the system is ready to compute similarities. Assume 
that two points: X 0 and Xi (X 0 < Xi) are given as depicted in FIG. 1 1 and the similarity 
between the two is to be measured. The first step in calculating similarity is normalizing 
Xo and Xi with respect to X med . This is done by taking the euclidean distance between 

X med and each of the points to be compared. Once Xo and Xi have been normalized, the 

angle ZXoRXi (hereinafter designated 6) is calculated by the formula: 

6 = ArcTan(X! |h) = ArcTan(X 0 |h) (20) 

[0077] After 8 has been found, it must be normalized so that a relative measure of 
similarity can be obtained that lies within the similarity range. To ensure compliance 
with functional requirements (A) and (B) made earlier in this section, the relative 
similarity angle (SA) is given by: 

SA = l-— (21) 
90° 

[0078] Formula (21) satisfies both functional requirements established at the beginning 
of the section. The angle between X . andX was defined to be 90°, so the similarity 

mm max 7 J 

between X mjn and X max is 0. Also, the angle between equal values is 0°. The SA 
therefore will be confined to the interval between zero and one, as desired. 
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[0079] To measure similarity between two vectors using the BART methodology, the 
average of the element by element SAs are used. Given the vectors xi and xi the SA is 
found by first calculating Sj for i= 1,2,3. ..n for each pair of elements in xi and xi i.e., 

if f. = iXn Xn X» ■ ■ ■ X Jand ± = iX 2l X» X 2 y X J 
[0080] The vector SA V is found by averaging over the S^s and is given by the 
following equation: 




[0081] In general, when given a set of multivariate observation data from a process (or 
other source of signals), linear regression could be used to develop a process model that 
relates all of the variables in the process to one another. An assumption that must be 
made when using linear regression is that the cross-correlation information calculated 
from the process data is defined by a covariance matrix. When the cross-correlation 
between the process variables is nonlinear, or when the data are out of phase, the 
covariance matrix can give misleading results. The BART methodology is a nonlinear 
technique that measures similarity instead of the traditional cross-correlation between 
variables. One advantage of the BART method is that it is independent of the phase 
between process variables and does not require that relationships between variables be 
linear. 

[0082] If there is a random observation vector y and a known set of process observation 
vectors from a process P, it can be determined if y is a realistic observation from a 
process P by combining BART with regression to form a nonlinear regression method 
that looks at vector SAs as opposed to euclidean distance. If the know observation 
vectors taken from P are given by 
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fu V 



H = 



*31 



v h kiy 



Mm 
^2m 



= [*. h 2 



(23) 



where H is k by m (k being the number of variables and m the number of observations), 
then the closest realistic observation vector to y in process P given H is given by 
y = Hw (24) 



[0083] Here w is a weighting vector that maps a linear combination of the observation 
vectors in H to the most similar representation of y. The weighting vector w is calculated 
by combining the standard least squares equation form with BART. Here © stands for 
the SA operation used in BART. 

w = (H' ©H^H'ey (25) 



[0084] An example of use of the BART methodology was completed by using 10 EBR- 
II sensor signals. The BART system was trained using a training data set containing 
1440 observation vectors. Out of the 1440 observation vectors, 129 of these were chosen 
to be used to construct a system model. The 129 vectors were also used to determine the 
height, h, of the angle domain boundary as well as the location of the BART reference 
point R for each of the sensors used in the experiment. To test the accuracy of the model 
900 minutes of one minute data observation vectors under normal operating conditions 
were run through the BART system. The results of the BART system modeling accuracy 
are shown in FIGS. 12-16 and FIGS. 17-21 (BART modeled). The Mean Squared Errors 
for each of the sensor signals is shown in Table III. 
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TABLE III 


BART System Modeling Estimation Mean Squared Errors for EBR-II Sensor Signals 












Sensor 
Channel 


Sensor Description 


MSE of 
Estimation 
Error 


Normalized 

MSE 
(MSE/H 3 ) 


Normalized 
MSE 
(MSE/o 3 ) 


1. 


Primary Pump #1 Power (KW) 


0.0000190 


0.0000002 


0.0002957 


2. 


Primary Pump #2 Power (KW) 


0.0000538 


0.0000004 


0.0004265 


3. 


Primary Pump #1 Speed (RPM) 


0.0000468 


0.0000001 


0.0005727 


4. 


Primary Pump #2 Speed (RPM) 


0.0000452 


0.0000001 


0 0004571 


5. 


Reactor Outlet Flowrate (GPM) 


8.6831039 


0.0009670 


0.1352974 


6. 


Primary Pump #2 Flowrate (GPM) 


0.0571358 


0.0000127 


0.01.63304 


7. 


Subassembly Outlet Temperature 1A1 (F) 


0.0029000 


0.0000034 


0.0062368 


8. 


Subassembly Outlet Temperature 2B1 (F) 


0.0023966 


0.0000027 


0.0052941 


9. 


Subassembly Outlet Temperature 4E1 (F) 


0.0025957 


0.0000029 


0.0050805 


10. 


Subassembly Outlet Temperature 4F1 (F) 


0.0024624 


0.0000028 


0.00 1358 



[0085] A second example shows the results of applying BART to ten sensors signals 
with three different types of disturbances with their respective BART estimates 
superimposed followed by the SPRT results when applied to the estimation error signals. 
The first type of disturbance used in the experiment was a simulation of a linear draft in 
channel #1 . The drift begins at minute 500 and continues through to the end of the 
signal, reaching a value of 0.21% of the sensor signal magnitude and the simulation is 
shown in FIG. 22A. The SPRT (FIG. 2B) detects the drift after it has reached a value of 
approximately 0.06% of the signal magnitude. In FIG. 22C a simulation of a step failure 
in channel #2 is shown. Here the step has a height of 0.26% of the signal magnitude and 
begins at minute 500 and continues throughout the signal. FIG. 22D shows the SPRT 
results for the step failure. The SPRT detects the failure immediately after it was 
introduced into the signal. The last simulation was that of a sinusoidal disturbance 
introduced into channel #6 as shown in FIG. 22E. The sinusoid starts at minute 500 and 
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continues throughout the signal with a constant amplitude of 0.15% of the sensor signal 
magnitude. The SPRT results for this type of disturbance are shown in FIG. 22F. Again 
the SPRT detects the failure even though the sinusoid's amplitude is within the operating 
range of the channel #6 sensor signal. 

[0086] While preferred embodiments have been shown and described, it should be 
understood that changes and modifications can be made therein without departing from 
the invention in its broader aspects. For example, it is possible that signals or waveforms 
could be measured from processes other than those in the manufacturing or biological 
fields. Additionally, there are many comparison techniques that could be used to 
correlate and compare the signals measured according to this invention. Various features 
of the invention are defined in the following Claims. 



